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Abstract. Selman and Kautz's work on "knowledge compilation" estab- 
lished how approximation (strengthening and/or weakening) of a propo- 
sitional knowledge-base can be used to speed up query processing, at the 
expense of completeness. In this classical approach, querying uses Horn 
over- and under- approximations of a given knowledge-base, which is rep- 
resented as a propositional formula in conjunctive normal form (CNF). 
Along with the class of Horn functions, one could imagine other Boolean 
function classes that might serve the same purpose, owing to attractive 
deduction-computational properties similar to those of the Horn func- 
tions. Indeed, Zanuttini has suggested that the class of affine Boolean 
functions could be useful in knowledge compilation and has presented 
an affine approximation algorithm. Since CNF is awkward for presenting 
affine functions, Zanuttini considers both a sets-of-models representation 
and the use of modulo 2 congruence equations. In this paper, we pro- 
pose an algorithm based on reduced ordered binary decision diagrams 
(ROBDDs). This leads to a representation which is more compact than 
the sets of models and, once we have established some useful properties 
of affine Boolean functions, a more efficient algorithm. 



1 Introduction 

A recurrent theme in artificial intelligence is the efficient use of (propositional) 
knowledge-bases. A promising approach, which was initially proposed by Selman 
and Kautz [11], is to query (and perform deductions from) upper and lower 
approximations, commonly called envelopes and cores respectively, of a given 
knowledge-base. By choosing approximations that allow more efficient inference, 
it is often possible to quickly determine that the envelope of the given knowledge- 
base entails the query, and therefore so does the full knowledge-base, avoiding 
the costly inference from the full knowledge-base. When this fails, it may be 
possible to quickly show that the query is not entailed by the core, and therefore 
not entailed by the full knowledge-base. Only when both of these fail must the 
full knowledge-base be used for inference. 

It is usually assumed that Boolean functions are represented in clausal form, 
and that approximations are Horn [11,5], as inference from Horn knowledge- 
bases is exponentially more efficient than from unrestricted knowledge-bases. 
However, it has been noted that there are other well-understood classes that 



have computational properties that include some of the attractive properties of 
the Horn class. 

Zanuttini [12, 13] discusses the use of other classes of Boolean functions for 
approximation and points out that affine approximations have certain advan- 
tages over Horn approximations, most notably the fact that they do not blow 
out in size. This is certainly the case when afhne functions are represented in 
the form of modulo-2 congruence equations. The more general sets-of-models 
representation is also considered by Zanuttini. In this paper, we consider an- 
other general representation, namely the well-known Reduced Ordered Binary 
Decision Diagrams (ROBDDs). We prove some important properties of afSne 
functions represented as ROBDDs, and present a new ROBDD algorithm for 
deriving aflinc envelopes. 

The balance of the paper proceeds as follows. In Section 2 we recapitulate the 
definition of the Boolean afhne class, and we establish some of their important 
properties. Wc also briefly introduce ROBDDs, but mainly to fix our notation, 
as we assume that the reader is familiar with Boolean functions and their repre- 
sentation as decision diagrams. Section 3 recalls the model-based affine envelope 
algorithm, and develops our own ROBDD-based algorithm, along with a correct- 
ness proof. Section 4 describes our testing methodology, including our algorithm 
for generating random ROBDDs, and presents our results. Section 5 discusses 
related work and applications, and concludes. 

2 Prepositional Classes, Approximation and ROBDDs 

We use ROBDDs [1, 2] to represent Boolean functions. Horiyama and Ibaraki [6] 

have recommended ROBDDs as suitable for implementing knowledge bases. Our 
choice of ROBDDs as a data structure was not so much influenced by that rec- 
ommendation, as by the convenience of working with a canonical representation 
for Boolean functions, and one that lends itself to inductive reasoning and re- 
cursive problem solving. Additionally, ROBDD-based inference is fast, and in 
particular, checking whether a valuation is a model of an n-place function given 
by an ROBDD requires a path traversal of length no more than n. 

ROBDD algorithms for approximation are of interest in their own right and 
some find applications in dataflow analysis [8]. Prom this aspect, this paper 
continues earlier work by Schachte and S0ndergaard [8,9] who gave algorithms 
for finding monotone, Krom, and Horn envelopes. Here we introduce an ROBDD 
algorithm for affine envelopes, which is new. 

2.1 Boolean functions 

Let B = {0,1} and let V be a denumerable set of variables. A valuation ix:V ^ B 
is a (total) assignment of truth values to the variables in V. Let T = V — > B 
denote the set of V-valuations. A partial valuation : V — > ;B U {_L} assigns 
truth values to some variables in V, and _L to others. Let Xp = V — > B U {-L}. 
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We use the notation ijl[x h- > i], where x & V and i £ B, to denote the valuation 
fj, updated to map x to i, that is, 



A Boolean function over V is a function cp : X ^ B. We let B denote the 
set of all Boolean functions over V. The ordering on B is the usual: x < y iS 
X = 0\/ y = 1. is ordered pointwise, so that the ordering relation corresponds 
exactly to classical entailment, |=. It is convenient to overload the symbols for 
truth and falsehood. Thus we let 1 denote the largest element of B (that is, 
A/zJ) as well as of B. Similarly denotes the smallest element of B (that is, 
X^i.O) as well as of B. A valuation /i is a model for (p, denoted /i |= ip, if (p{j-i) = 1. 
We let models{(f) denote the set of models of (f. Conversely, the unique Boolean 
function that has exactly the set M as models is denoted fn{M). A Boolean 
function ip is said to be independent of a variable x when for all valuations /x, 
IJ.[x i-^ 0]\= (f iS n[x i-^ 1] \= (fi. 

In the context of an ordered set of k variables of interest, xi,. . . ,Xk, we may 
identify with /x the binary sequence bits(/i) of length k: 

H{x-i),...,n{xk) 

which we will write simply as a bit-string of length k. Similarly we may think 
of, and write, the set of valuations M as a set of bit-strings: 

bits(M) = {bits(/i) | M e M} 

As it hardly creates confusion, we shall present valuations variously as functions 
or bitstrings. We denote the zero valuation, which maps Xi to for all 1 < i < A;, 
by 0. 

We use the Boolean connectives (negation), A (conjunction), V (disjunc- 
tion) and + (exclusive or, or "xor"). These connectives operate on Boolean 
functions, that is, on elements of B. Traditionally they are overloaded to also 
operate on truth values, that is, elements of ,B. However, we deviate at this point, 
as the distinction between xor and its "bit-wise" analogue will be critical in what 
follows. Hence we denote the B (bit) version by ®. We extend this to valuations 
and bit-strings in the natural way: 

(Ml ® M2) {x) = Ml {x) © M2 (x) 

and we let ©3 denote the "xor of three" operation A/xiM2/U3./ii © M2 © A*3- We 
follow Zanuttini [12] in further overloading '®' and using the notation 

= /i ® M = {ju © /i' I m' e M} 

We read as "M translated by /x". Note that for any set M, the function 
Xji.Mf^ is an involution: (A'/^)^ = M . 

A final overloading results in the following definition. For <p S B, and fi &I, 
let </3 © M = fn{Mfj) where M = models{ip). 
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2.2 The afflne class 



An affine function is one whose set of models is closed under pointwise applica- 
tion of ®3 [10]. Afhne functions have a number of attractive properties, as we 
shall see. Syntactically, a Boolean function is affine iff it can be written as a 
conjunction of affine equations 

CiXi + C2X2 + . . . + CkXk = Co 

where Ci S {0, 1} for all i G {0, .., k}.^ This is well known, but for completeness 
we prove it below. 

The affine class contains 1 and is closed under conjunction. Hence the concept 

of a (unique) affine envelope is well defined, and the operation of taking the affine 
envelope is an upper closure operator [8]. For convenience, let us introduce a 
name for this operator: 

Definition 1. Let y be a Boolean function. The affine envelope, aff{(p), of ip is 
defined: 

o-ffiv) — A'f-^ \ V \='>P and V' is affine} ■ 

There are numerous other classes of interest, including isotone, antitone, Krom, 
Horn, fc-Horn [4], and fc-quasi-Horn functions, for which the concept of an enve- 
lope is well-defined, as they form upper closure operators [9].^ 

Zanuttini [12] exploits the close connection between vector spaces and the 
sets of models of affine functions. For our purposes we call a set S of bistrings a 
vector space iff G 5 and S is closed under ©. The next proposition simplifies 
the task of doing model-closure under ©3. 

Proposition 1 ([12]). A non-empty set of models M is closed under ©3 iff 
is a vector space, where fi is any element of M. 

Proof: Let /z be an arbitrary element of M. Clearly contains 0, so the right- 
hand side of the claim amoimts to being closed under ©. 

For the 'if direction, assume is closed under © and consider /zi, /i2, /^3 G 
M. Since n® H2 and ijl® ^j,^ are in M^^, so is 1x2 © Ms- And since furthermore 
is in Af^, so is © /^i © ^2 ffi A^3- Hence /ii © /i2 © ^3 is in Af. 

For the 'only if direction, assume M is closed under ©3, and consider ^1,^2 G 
M^. All of /i, /U © Hi and jJL® H2 are in M, and so © (/i © /xi) © (/U © ^12) = 
IJ.® Hi® H2 & M. Hence fii ® iJ,2 € M^. ■ 

^ In some circles, such as cryptography/coding community, the term "affine" is used 
only for a function that is or 1, or can be written cixi + C2X2 + . . . + CkXk + co (the 
latter is what Post [7] called an "alternating" function). The resulting set of "affine" 
functions is not closed under conjunction. 

^ Popular classes such as unate functions and renamable Horn are not closed under 
conjunction and therefore do not have well-defined concepts of (unique) envelopes. 
For example, a; — > 3/ and x -t— y both are unate, while a; <-> y is not, so the "unate 
envelope" of the latter is not well-defined. 
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Proposition 2. A Boolean function is affine iff it can be written as a conjunc- 
tion of equations 

CiXi + C2X2 + . . . + CkXk = Co 

where Ci £ {0, 1} for all i € {0, .., k}. 

Proof: Assume the Boolean function ip is given as a conjunction of equations of 
the indicated form and let /ii, /i2 and ns be models. That is, for each equation 
we have 

Ci^l(xi) + C2lJ.l{x2) + . . . + Cfe^i(xfc) = Co 
Cl^J.2{Xl) + C2/U2(X2) + . . . + Ckfl2{Xk) = Cq 
Clll3{xi) + C2H3{X2) + . . . + CklJ.3{Xk) = Co 

Adding left-hand sides and adding right-hand sides, making use of the fact that 
'•' distributes over '-|-', we get 

Ci/i(xi) -I- C2I^{X2) -I- ... -I- CklJ.{Xk) = Co -I- Co -I- Co = Co 

where /i = /ii ® /i2 © Ms- As M thus satisfies each equation, /i is a model of ip. 
This establishes the 'if direction. 

For the 'only if part, note that by Proposition 1, we obtain a vector space 
from any non-empty set M closed under ©3 by translating each element of 
M hy n € M. A basis for can be formed by taking one vector at a time from 
and adding it to the basis if it is linearly independent of the existing basis 
vectors. From this basis, a set of linear equations 

aiiXi © • • • © aikXk = 
a2ia;i ® • • • ® a2kXk = 

ajixi ® • • • ® ajkXk = 

can be computed that have exactly as their set of models (a method is 
provided by Zanuttini [12], in the proof of his Proposition 3). Each function 
fi = Xxi, . . . ,Xk-aiiXi © •■• © aikXk is linear, so for v G M^, /^{ly © /x) = 
-|- = fiin)- Hence M can be described by the set of affine equations 

auxi ® • • • ® aikXk = /i(m) 
a2ia;i ® • • • ® a2kXk = /2(m) 

ajiXi © • • • ffi ttjkXk = fjin) 
as desired. ■ 

It follows from the syntactic characterisation that the number of models pos- 
sessed by an affine function is either or a power of 2. Other properties will now 
be established that are used in the justification of the affine envelope algorithm 
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of Section 3. The first property is that if a Boolean function has two models 
that differ for exactly one variable v, then its afBne envelope will be indepen- 
dent of V. To state this precisely we introduce a concept of a "characteristic" 
valuation for a variable. 

Definition 2. In the context of a set of variables V, let v gV. The characteristic 
valuation for v, Xv, is defined by 

, , ( 1 ii X = V 
Xv[x) = Otherwise ■ 

Note that ij,(BXv is the valuation which agrees with /z for all variables except v. 
Moreover, if |= then both of /x and ij,(BXv are models of 3v{ip). 

Proposition 3. Let <fhea. Boolean function whose set of models forms a vector 
space, and assume that for some valuation /x and some variable v, fi and fi®Xv 
both satisfy (p. Then (p is independent of v. 

Proof: The set M of models contains at least two elements, and since it is closed 
under ®, Xv is a model. Hence for every model v of (p, ^(BXv is another model. 
It follows that (f is independent of v. m 

Proposition 4. Let ip he a Boolean function. If, for some valuation ^ and some 
variable u, /i and fj, © Xv both satisfy ip, then a,ff{ip) = 3v{aff(ip)). 

Proof: Let /U be a model of p, with fi © Xm ^^^^ ^ model. For every model v of 
<p, we have that i' © ^ ffi © Xv) satisfies aff{ip), that is, v © Xv H '^■ffif )- 
since both u and G) Xv satisfy aff{ip), it follows that 3v(aff{p)) cannot have a 
model that is not already a model of aff{(p) (and the converse holds trivially). 
Hence ajfiip) = 3v{aff{<p)). m 

Proposition 5. For all Boolean functions (p, aff{3v{ip)) — 3v{aff{ip)). 

Proof: We need to show that the models of aff{3v{p)) are exactly the models of 
3v{ajf{p)). Clearly qff{3v{p)) is iff is iff 3v{aff{p)) is 0. So we can assume 
that aff{3v{(f)) is satisfiable — let fj, \= ajf{3v{(f)). Then, for some positive odd 
number k, 

M = /il © M2 © • • • © Mfc 

with jii,...,jik being different models of 3v{ip>). These k models can be parti- 
tioned into two sets, according as they satisiy let 

M = {m\l<i<k,Hi\=ip} M' = {in\l<i<k,Hi\l=ip} 

Then both M and M^^ consist entirely of models of ip. Hence, depending on the 
parity of M's cardinality, either /i or /x © is a model of aff{ip) (or both are). 
In either case, ji \= 3v{aff{ip)). 

Conversely, let /u |= 3v{aff{ip)). Then either /x or © x„ is a model of aff{ip) 
(or both are) . Hence /i (or /i © Xy as the case may be) can be written as a sum 
of k models in , . . . , iik {k odd) of (p. It follows that both /^i © /X2 © ■ • ■ © /Xfe and 
Ml ® ® • • • ® M's ® are models of 3v{ip). Hence m 1= o-ff{^v{ip))). u 
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2.3 ROBDDs 



We briefly recall the essentials of ROBDDs [3]. Let the set V of propositional 
variables be equipped with a total ordering -<. Binary decision diagrams {BDDs) 
are defined inductively as follows: 

- is a BDD. 

- 1 is a BDD. 

- If a; e V and Ri and R2 are BDDs then ite(a;, i?2) is a BDD. 

Let R = ite(x, Ri^R^). We say a BDD R' appears in RiS R' = R or R' appears 
in Ri or R2. We define vars(ii) = {v \ \te{v, _, _) appears in R}. The meaning of 
a BDD is given as follows. 

[01 =0 
[11 =1 

Dteix, Ri,R2)j = ixA [i?i]) V A IR2}) 

A BDD is an Ordered binary decision diagram (OBDD) iff it is or 1 or if it is 
ite(x, i?2), Ri and R2 arc OBDDs, and \fx' e vars(i?i) U vars(i?2) '■ x ^ x' . 

An OBDD R is a Reduced Ordered Binary Decision Diagram (ROBDD [2, 3]) 
iff for all BDDs i?i and R2 appearing in R, Ri = R2 when = [-R21- Practical 
implementations [1] use a function mknd(a;, i?2) to create all ROBDD nodes 
as follows: 

1. If i?i = 7?2! return i?i instead of a new node, as |ite(.x, i?2)] = [RiJ- 

2. If an identical ROBDD was previously built, return that one instead of a new 
one; this is accomplished by keeping a hash table, called the unique table, of 
all previously created nodes. 

3. Otherwise, return ite(a;, -Ri, i?2)- 

This ensures that ROBDDs are strongly canonical: a shallow equality test is suf- 
ficient to determine whether two ROBDDs represent the same Boolean function. 

Figure 2 shows some example ROBDDs. The ROBDD in Figure 2(a) denotes 
the function which has five models: {00011, 00110, 01001, 01101, 10101}. In gen- 
eral we depict the ROBDD ite(a;, i?2) as a directed acyclic graph rooted in 
X, with a solid arc from x to the dag for i?i and a dashed line from x to the dag 
for i?2 . However, to avoid unnecessary clutter, we omit the node (sink) for and 
all arcs leading to that sink. 

As a typical example of an ROBDD algorithm. Algorithm 1 generates the 
disjunction of two given ROBDDs. This operation will be used by the affine 
approximation algorithm presented in Section 3. 

Algorithm 2 is used to extract a model from an ROBDD. For an imsatis- 
fiable ROBDD (that is, 0) we return _L. Although presented here in recursive 
fashion, it is better implemented in an iterative manner whereby we traverse 
through the ROBDD, one pointer moving down the "else" branch at each node, 
a second pointer trailing immediately behind. If a 1 sink is found, we return the 
path traversed thus far and note that any further variables which we are yet to 
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Algorithm 1 The "or" operator for ROBDDs 

or(l,-) = 1 
or(0, _) = 
or(., 1) = 1 
or(_, 0) = 

or(ite(x,r, E),\te{x',T',E')) 

\x^x' = mknd(,2-, or(T. ite(x', T' , E')), or(E, ite(x', T' , E'))) 
\x' ~<x = mknd{x' ,or{\te{x,T, E),T'),or{\te{x,T,E), E')) 
I otherwise = mknd(a:, or(T, T'), or(£;, S')) 



Algorithm 2 get_model algorithm for ROBDDs 
get_model(0) — _L 
get_model(l) — Xv.l. 
get_model(ite(a;,T, £)) = 

let fj, = get_model(r) 

in 

if /^ = _L then 

get_model(£)[a; >->■ 0] 
else nlx H- > 1] 



encounter may be assigned any value. If a sink is found, we use the trailing 
pointer to step up a level, follow the "then" branch for one step and continue 
searching for a model by following "else" branches. This method relies on the 
fact that ROBDDs arc "reduced" , so that if no 1 sink can be reached from a 
node, then the node itself is the sink. 

We shall later use the following obvious corollary of Proposition 3: 

Corollary 1. Let ROBDD R represent a function whose set of models form a 
vector space. Then every path from R's root node to the i-sink contains the 
same sequence of variables, namely vars(i?) listed in variable order. ■ 

It is important to take advantage oi fan-in to create efficient ROBDD algorithms. 
Often some ROBDD nodes will appear multiple times in a given ROBDD, and 
algorithms that traverse that ROBDD will meet these nodes multiple times. 
Many algorithms can avoid repeated work by keeping a cache of previously seen 
inputs and their corresponding outputs, called a computed table, see Brace et 
al. [1] for details. 

3 Finding AfRne Envelopes for ROBDDs 

Zanuttini [12] gives an algorithm, here presented as Algorithm 3, for finding the 
affine envelope, assuming a Boolean function (f is represented as a set of models. 
This algorithm is justified by Proposition 1. 
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Algorithm 3 The sets-of-models based afEne envelope algorithm 



Input: The set M of models for function (p. 

Output: aff(M) — tlie set of models of i^'s afEne envelope. 

if M = then 

return M 
end if 
iV<-0 
choose fj, € M 
New <— 
repeat 

N ^ NU New 

New ^ {mi ® M2 I Ml, M2 € AT} \ AT 
until New = 
return iVp 



M 



( 01011 1 
01100 
10111 
11001 



M = 01100 



00111 
00000 
11011 
10101 



N 



( 00111 1 
00000 
11011 
10101 
11100 
10010 
OHIO 
01001 



Ni, = afF(M) = < 



( 01011 ^ 
01100 
10111 
11001 
10000 
11110 
00010 
00101 



Fig. 1: Steps in Algorithm 3 



Example 1. To see Algorithm 3 in action, assume that ip has four models, M = 
{01011, 01100, 10111, 11001}, and refer to Figure 1. We randomly pick jjl = 01100 
and obtain A'/^ as shown. The first round of completion under adds three 
bit-strings: {11100, 10010,01110}, and another round adds 01001 to produce A^. 
Finally, "adding back" n = 01100 yields the afBne envelope = aff(M). ■ 

Wo arc interested in developing an algorithm for ROBDDs. We can improve 
on Algorithm 3 and at the same time make it more suitable for ROBDD manip- 
ulation. The idea is to build the result TV step by step, by picking the models ly 
of one at a time and computing := TV U N,^ at each step. We can start 
from N — {0}, as has to be in M^. This leads to Algorithm 4. 

This formulation is well suited to ROBDDs, as the operation N^, that is, 
taking the xor of a model v with each model of the ROBDD N can be im- 
plemented by traversing N and, for each v-node with v{v) = 1, swapping that 
node's children. And we can do better, utilising two observations. 

First, during its construction, there is no need to traverse the ROBDD N for 
each individual model v. A full traversal of N will find all its models systemati- 
cally, eliminating a need to remove them one by one. 
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Algorithm 4 A variant of Algorithm 3 
Input: The set M of models for function (p. 
Output: aff(M) — the set of models of i/j's afEne envelope, 
if M = then 

return M 
end if 
iV^{0} 
choose fi £ M 
R^M^\ {0} 
for all u £ R do 

N ^ NUN^ 
end for 
return iV^ 



Second, the ROBDD being constructed can be simplified aggressively dur- 
ing its construction, by utilising Propositions 4 and 5. Namely, as we traverse 
ROBDD R systematically, many paths from the root to the i-sink will be found 
that do not contain every variable in vars(i?). Each such path corresponds to a 
model set of cardinality 2*^, k being the number of "skipped" variables. Proposi- 
tion 4 tells us that, eventually, the afhne envelope will be independent of all such 
"skipped" variables, and Proposition 5 guarantees that variable elimination can 
be interspersed arbitrarily with the process of "xoring" models, that is, we can 
eliminate variables aggressively. 

This leads to Algorithm 5. The algorithm combines several operations in 
an effort to amortise their cost. We present it in Haskell style, using pattern 
matching and guarded equations. In what follows we step through the details of 
the algorithm. 

The to_afF function finds an initial model /i of i?, before translating i?, through 
the call to translate. This initial call to translate has the effect of "xor-ing" n 
with all of the models of R. Once translated, the xor closure is taken, before 
translating again using the initial model /i to obtain the affinc closure. 

translate is responsible for computing the xor of a model with an ROBDD. 
Its operation relies on the observation that for a given node v in the ROBDD, if 
fi{v) = 1, then the operation is equivalent to exchanging the "then" and "else" 
branches of v. 

xor.close is used to compute the xor-closure of an ROBDD R. The third 

argument passed to trav is an accumulator in which the result is constructed. 
As in Algorithm 4, we know that will be a model of the result, so we initialise 
the accumulator as (the ROBDD for) /\{v \ v e vars(i?)}. 

trav implements a recursive traversal of the ROBDD, and when a model is 
found in /x, we "extend" the afhne envelope to include the newly found model. 
Namely, extend (i?, 5, /it) produces (the ROBDD for) i? V 5^. Note that once a 
model is found during the traversal, trav checks if /i is already present within 
the xor-closure, and if it is not, invokes extend accordingly. This simple check 
avoids making unnecessary calls to extend. 
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Algorithm 5 AfEne envelopes for ROBDDs 

Input: An ROBDD R. 

Output: The aifine envelope of R. 

to_afF(0) = 

to_afF(i?) = let /I = get_model(i?) in translate(xor_close(translate(7?, /n)), /x) 

translate(0, _) = 
translate(f , -) = 1 
translate(ite(a;, T, E), fi) 

I {n{x) = 0) = cons(x,translate(T, /i), translate(-E, /i), /j) 

I (//(a^) = 1) = cons(a;,translate(S, //),translate(T, 

xor_close(i?) = trav(7i, Aw.-L, | v £ vars(7i)}) 

trav(0, -,S) = S 
trav(l, jj,, S) 

\{phS)=S 

I otherwise = extend (S, 5", /j,) 
trav{\te{x,T,E),ii,S) = trav(T, 1], trav {E, fj,[x <->■ 0], S)) 

cons{x,T, E, fi) 

\{p{x)^±)^oriT,E) 

I otherwise = mknd(a:, T, _£) 

extend(l, _,_) = ! 
extend(_, 1, _) = 1 
extend (0, S", /i) = translate(S', ^) 

extend(ite(a::, T, 0, pi) — cons(a;, extend (T, 0, /i), extend 0, /u), /i) 
extend(ite(x, T, ^;), ite(a;, T' , E'),fi) 

I (/i(a;) = 1) = mknd(a;,extend(r,£',^),extend(S,r',^)) 

I otherwise = cons(a;,extend(r, T',//),extend(iJ, 



The cons function represents a special case of mknd. It takes an additional 
argument in /n and uses it to determine whether to restrict away the correspond- 
ing node being constructed. The correctness of cons rests on Propositions 4 and 
5, which guarantee that afhne approximation can be interspersed with variable 
elimination, so that the latter can be performed aggressively. 

Finally, once a model is found during a traversal, extend is used to build up 
the affine closure of the ROBDD. In the context of the initial call extend(S', S, n), 
Corollary 1 ensures that the pattern of the last equation for extend is sufficient: 
If neither argument is a sink, the two will have the same root variable. 

Example 2. Consider the ROBDD R (shown again in Figure 2(a)), whose set 
of models is {00011,00110,01001,01101, 10101}. Picking fi ^ 00011 and trans- 
lating gives R/j,, shown in Figure 2(b). This ROBDD represents a set of vectors 
{00000, 00101, 01010, OHIO, 10110} which is to be extended to a vector space. 
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Fig. 2: (a): An example ROBDD R; note that all our ROBDD diagrams leave out the 
0-sink and all arcs to it. (b): The translated version Ji^. (c): The vector space S that 
has been extended to cover 00101. 




Fig. 3: (a): The vector space S after being extended to cover 01X10. (b): S after 
extending to cover 10110. (c): S translated to give the afBne closure of R. 

The algorithm now builds up S, the xor-closurc of i?^, by taking one vector 
t; at a time from and extending 5 to a vector space that includes v. S begins 
as the zero vector. 

The first step of the algorithm just adds 00101 to the existing zero vector 
(Figure 2(c)). The next step comes across the vector 01X10 (which actually 
represents two valuations) and existentially quantifies away the variable x (Fig- 
ure 3(a)). Note that the variable z also disappears: this is due to the extension 
required to include 01X10 that adds enough valuations such that z is "covered" 
by the vector space. 
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Algorithm 6 Generation of random Boolean functions as ROBDDs 
Input: The number n of variables in the random function, 

pr a caUbrator set so that the probability 

of a valuation being a model is 2~^^. 
Output: A random Boolean function represented as an ROBDD. 

gen_rand_bdd(n,pr) = rand_bdd(0,n — l,pr) 

rand_bdd('m, n, pr) 

I (m — n) = mknd(m, rand_sink, rand_sink) 
I otherwise = mknd(m, T, _B) 
where 

T = if (m > n — pr) A cointoss() then rand_bdd(m + 1, n, pr) else 
E = if {m > n — pr) A cointoss() then rand_bdd(m + 1, n,pr) else 

rand_sink= if cointoss() then else 1 

cointoss() returns i or with equal probability. 



Extending to cover 10110 simply requires every model to be copied, with v 
mapped to 1 (Figure 3(b)). Finally, translating back by /x produces A, the afSne 
closure of R, shown in Figure 3(c). ■ 

4 Experimental Evaluation 

To evaluate Algorithms 3 and 5 we generated random Boolean functions using 
Algorithm 6. We generated random Boolean functions of n variables, with an 
additional parameter to control the density of the generated function, that is. 
to set the likelihood of a random valuation being a model. For Algorithm 3 we 
extracted models from the generated ROBDDs, so that both algorithms were 
tested on identical Boolean functions. 

gen_rand_bdd(n,pr) builds, as an ROBDD R, a random Boolean function 
with the property that the likelihood of an arbitrary valuation satisfying R is 
2~P^. It invokes rand_bdd(0, n— l,pr). This recursive algorithm builds a ROBDD 
of {n—pr) variables and at depth {n—pr), a random choice is made as to whether 
to continue generating the random function or to simply join the branch with 
a sink. If the choice is to continue, then the algorithm recursively applies 
rand_bdd(m + l,n,pr) to the branch. 

By building a "complete" ROBDD of (n — pr) variables, we were able to 
distribute the number of models for a given number of variables. In this way, we 
were able to compare the various algorithms for differing model distributions. 

Table 1 shows the average time (in milliseconds) taken by each of the algo- 
rithms over 10,000 repetitions with the probability 1/1024 of a valuation being 
a model. Timing data were collected on a machine running Solaris 9, with two 
Intel Xeon CPUs running at 2.8GHz and 4GB of memory. Only one CPU was 
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Variables 


Algorithm 3 


Algorithm 5 


12 


0.021 


0.017 


15 


5.991 


0.272 


18 




0.407 


21 




1.710 


24 




14.967 



Table 1: Average time in milliseconds to compute one affine envelope 



used and tests were run under minimal load on the system. Our implementa- 
tion of Algorithm 3 uses sorted arrays of bitstrings (so that search for models 
is logarithmic). As the number of models grows exponentially with the number 
of variables, it is not surprising that memory consumption exceeded available 
space, so we were unable to collect timing data for more than 15 variables. 

5 Conclusion 

Approximation and the generation of envelopes for Boolean formiilas is used ex- 
tensively in the querying of knowledge bases. Previous research has focused on 
the use of Horn approximations represented in conjunctive normal form (CNF). 
In this paper, following the suggestion of Zanuttini, we instead focused on the 
class of afhne functions, using an approximation algorithm suggested by Zanut- 
tini [12]. Our initial implementation using a naive sets-of-models (as arrays of 
bitstrings) representation was disappointing, as even for functions with very few 
models, the afhne envelope often has very many models (in fact, the affine enve- 
lope of very many functions is 1), so storing sets of models as an array becomes 
prohibitive even for functions over rather few variables. 

ROBDDs have proved to be an appropriate representation for many applica- 
tions of Boolean functions. Functions with very many models, as well as very few, 
have compact ROBDD representations. Thus we have developed a new affine en- 
velope algorithm using ROBDDs. Our approach is based on the same principle 
as Zanuttini's, but takes advantage of some useful characteristics of ROBDDs. In 
particular. Propositions 4 and 5 allow us to project away variables aggressively, 
often significantly reducing the sizes of the representations being manipulated 
earlier than would happen otherwise. 

Zanuttini [12] suggests an affine envelope algorithm using modulo 2 congru- 
ence equations as output, and proves a polynomial complexity bound. However, 
we preferred to use ROBDDs. As a functionally complete representation for 
Boolean functions, ROBDDs allow the same representation for input and out- 
put, keeping the algorithms simple. For example, the algorithm for evaluating 
whether one ROBDD entails another is very straightforward, whereas evaluat- 
ing whether a set of congruence equations entails a Boolean function in some 
other representation would be more complicated. It also means that systems 
which repeatedly construct an affine approximation, then manipulate it as a 
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general Boolean function, and then approximate this again, can operate with- 
out having to repeatedly convert between different representations. Importantly 
for our purposes, computing envelopes as ROBDDs permits us to use the same 
representation for approximation to many different Boolean classes. 

Further research in this area includes implementing Zanuttini's suggested 
modulo 2 congruence equations representation and comparing to our ROBDD 
implementation. This also includes evaluating the cost of determining whether a 
set of congruence equations entail a given general Boolean function. We also will 
compare afRne approximation to approximation to other classes for information 
loss to evaluate whether afBne functions really are as suitable for knowledge-base 
approximations as Horn or other functions. 
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